Quantum Solution to the Extended Newcomb's Paradox 
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We regard the Newcomb's Paradox as a reduction of the Prisoner's Dilemma and search for the 
considerable quantum solution. The all known classical solutions to the Newcomb's problem always 
imply that human has freewill and is due to the unfair set-up (including strategies) of the Newcomb's 
Problem. For this reason, we here substitute the asymmetric payoff matrix to the general form of 
the payoff matrix (M) and consider both of them use the same quantum strategy. As a result we 
obtained the fair Nash equilibrium, which is better than the case using classical strategies. This 
means that whether the supernatural being has the precognition or not depends only on the choice 
of strategy. 

PACS numbers: 



I. INTRODUCTION 

A quantum game theory is a natural extension to the 
quantum world of the ubiquitous classical game theory. 
Von Neumann pointed out 1], a game is simply the to- 
tality of the rule which describe it. The elements of the 
rule such as the rational choices (strategies) are one of the 
main component of a general game. Moveover a game as- 
sumed that each individual player is trying to maximize 
his own advantage, without concern for the well-being of 
the other players. After the theory of the ganeral game 
has been broken down into details we can meet sev- 
eral games to play more efficiently for his profits. For 
example, von Neumann's the mathematical approaches 
help us rational decision for the economic behavior, which 
strategies are fit and what's the causal relationship. The 
Darwin's theory for the 'survival of fittest' is maybe the 
another well-known game, the evolutionary theory, de- 
scribe the biological evolution in nature, it is described 
by the Evolutionarily Stable Strategy (ESS) 0]. 

Here, we present two games known as the Prisoner's 
dilemma(PD) ||] and the Newcomb's Paradox(NP) [||. 
Those game have some irrational consequence, i.e., the 
players cannot concurrently increase their payoff using a 
certain strategy in the PD. Also in the NP case, the su- 
pernatural being(5S)'s prophecy fail to facing human's 
freewill surprisingly. Game theorists maybe call this am- 
biguous situation as a dilemma or paradox for a given 
competition. We will solve the NP under the well-known 
PD solution, which means the PD's symmetric consider- 
ation helpful to grasp the NP. 

The classical games, as long as its history, form a var- 
ious kinds of game contained certain rules and it also 
have studied deeply. Its solutions of the dilemmatic view 
point are generally solved by the Nash equilibrium gov- 
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erned by a dominant strategy. While the usual quantum 
games 0, Q have comparatively short history, but their 
solution shows the similar Nash equilibrium pattern — 
the changed Nash equilibrium can be compared to the 
classical one. Meyer has already proved that an op- 
timal quantum strategy in a two-person zero-sum game 
surpass against to an optimal classical (mixed) strategy. 
If the participant, Alice, is using a probabilistic (mixed) 
quantum strategies against to Bob's the deterministic 
classical strategies, then Alice always win to Bob on the 
game. Furthermore a two-person static game has a Nash 
equilibrium for the quantum versus quantum strategy. 
For consider the NP, SB's deterministic prophecy gives 
an unfair situation for the human's probabilistic strate- 
gies. So we need to reconstruct the original payoff to the 
new reasonable payoff structure. 



II. CLASSICAL STRATEGY AND ITS 
SOLUTION 

A. NP and PD in the Classical Game 

In the PD each of the players, Alice and Bob, must in- 
dependently decide whether they choose to defect (strat- 
egy D) or cooperate (strategy C). Depending on their 
decision, each player receives a certain payoff 11^. The 
strategy D is the dominant and will be in equilibrium. 
I.e., = {D,D} and {UA{^A),'nB(.^B)} = 

{1, 1} in this game. A dominant strategy for Alice 
is defined by a strategy such that the payofll Ha 
has the property Ua^'^a,^-' b) > Ha{^^a,^-'b) for all 
€ S'ajS^b € Sb provided such a strategy exists, 
where i,j € N. The strategy set Sk — {^^k} con- 
tains the all strategies i G N, where N is the number 
of the all possible strategies in the game and k denote 
the players A or B. Moreover Nash equilibrium, corre- 
sponding to the payoff {1, 1}, is a combination of strate- 
gies {S^jSb} such that neither party can increase his 
or her payoff by unilaterally departing from the given 
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TABLE I: The payoff matrix for the traditional Prisoner's 
Dilemma. The first entry in the round bracket denotes the 
payoff of Alice and the second one the payoff of Bob. 

Bob: C Bob: D 

Alice: C (3, 3) (0, 5) 

Alice: D (5, 0) (1, 1) 



TABLE IIL The payoff matrix for the extended Newcomb's 
Paradox. The entry S{< 0) restrict the human's choice. 
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TABLE H: The payoff matrix for the original Newcomb's 
Paradox. The elements of the matrix will give to human for 
his reward. 







SB: Boxl 


SB: Box2 


Human: 


Boxl 


$1 000 000 





Human: 


Box2 


$1 001 000 


$1 000 



equilibrium point i.e., UaC^Aj^b) > ^aC^^ a,^b) and 
IIb{^At^b) > ^ni'^A,'^'' b)- Another main concept of 
a game theory is the Pareto optimality, a pair of payoffs 
{Syi, Sb} — {3, 3}, if it is not jointly dominated by an- 
other point, and if neither party can increase his or her 
payoff without decreasing the payoff to the other party. 
Someone perhaps ask that the strategy {C, C} is more 
rational decision. But the dominant strategy D controls 
the PD. 

The NP is said to be a one-person asymmetric game 
such that the human's freewill conflict against to the 5S's 
prophecy. That is, given two boxes, Boxl and Box2, 
Box2 which contains $1 000 and Boxl which contains 
either nothing or $1 000 000, the human may pick ei- 
ther Boxl or both. However, at some time before the 
choice is made by human, SB has predicted what the hu- 
man's decision will be made 7] . Thereafter SB will fill up 
the Boxl with $1 000 000 if SB predicts human to take 
it, or with nothing if SB predicts human to take both 
boxes. The constructionally important notion of the NP 
is no other than its one-person game as the restricted 
two-person game PD's Q. We can see the original asym- 
metric payoff matrix for human in Table |nj 



B. Extended form of NP and PD 

Von Neumann referred to the fact that game theory 
only concern with the relation of the elements of the 
payoff matrix. So that in this paper, we can consider 
the extended version of the NP and PD. The general 
form of the corresponding payoff matrix is denoted 

f a 

by = Ml + M2 where Mi ^ 
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and 



M2 



S 

(3 



, M2 will be considered as two special 



case, an asymmetric payoff matrix and the quasi-skew 
symmetric matrix. Moreover we fix the matrix elements, 
in the payoff matrix, along to[3>a>^>5 and 5 < 0. 



(i) Asymmetric payoff matrix A: 
satisfied some next property 

A = Ai+A2 



where Ai ^ Mi is a diagonal matrix, A2 — 



If the matrix M is 



5 

P 



(1) 



A2 is an off-diagonal asymmetric matrix with \5\ < /3, 
then we call the matrix A as the asymmetric. Specially, 
the case oi S = occurs the original NP and PD. 

(ii) Quasi-skew symmetric payoff matrix S : The payoff 
matrix M is defined as a quasi-skew symmetric matrix 
S i.e., decomposed by 



S = Si+S2 



(2) 



where iSi = A^i is a diagonal matrix, ^2 



IS 





(3 

the skew-symmetric matrix which satisfies the property 
^2"^ — —S2. If we generally allow the extended payoff 
matrix M to the traditional NP, then one can naturally 
extend the matrix as corresponding two-person symmet- 
ric PD in Table HTllIVI As we see previously, the payoff 
matrix of the symmetric PD is more clear to interpret 
and easy to calculate the solution. Thereafter we will 
change the formation from the two-person's PD to the 
one-person's NP. Here we must consider a sense of the 
leaping element S in the matrix M. The payoff values, all 
elements of the matrix for human in the original NP, have 
always a positive value more than 0. This means that the 
payoffs are only beneficial to human, moreover he does 
cheat SB using the convex combination of the classical 
mixed strategies. The set-up is unreasonable and one- 
sided i.e., trying wouldn't do any harm for human. If 
we give a negative payoff element — /3, the case (ii), for 
the player such as SB^s penalty for human's cheating or 
human's entry fee, perhaps some anticipointment for an 
untruth of SB^s prophecy. Likewise the solution for the 
PD, the element S ^ —f3 also gives a corresponding Nash 
equilibrium. We adapt the Nash equilibrium consider as 
the quantum solution to the NP i.e., if human choose a 
then SB can exactly predict the human's choice a. 



III. QUANTUM STRATEGY 
A. Quantum Strategies and Quantum Circuit 

A general quantum game F is consisted of the player 
k, strategy T,k & Sk C Uk = tJki&T^') and payoff Hfe 
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TABLE IV: The payoff matrix for tlie extended Prisoner's 
Diiemma. Alice correspond to human and Bob to SB, aiso C 
change to the Boxl strategy and D equai to the Box2. 







SB: Boxl 


SB: Box2 


Human: 


Boxl 


(a, a) 


{3, P) 


Human: 


Box2 


{13, S) 


(7, 7) 



like as the general classical game |g|]. But the strate- 
gic space Sk is comparatively large space chosen from U , 
SkCtfkQU,U= {U{8,<P)\d e [0,7r] and </> £ [0,7r/2]} 
in Hilbert space H, compare to the classical determin- 
istic space Sk- The quantum games are following the 
next trivial stages. First, the well-known initial states 
are prepared by a umpire or the public renouncement, 
because the fairness of the game is ensured by its open 
source of the payoff. In PD, we set the initial state as 
two qubit state |00). Next step, both player conflict to 
win the game with the given strategy set Sk C Uk, which 
is the quantum mechanical operation. Main point of a 
game maybe make out what is the best strategy in the 
confliction. Finally, the last step is some detection of the 
game result by quantum measurement. 

We denote the initial state of the game by |?/'o) — J\00), 
where J is a unitary operation executed by an umpire and 
the adjustment of the parameter r gives a separable or 
entangled states. The unitary operation J is described 
by J = ebru»um^ where t e [0,7r/2] and U S U{e,(t)) 
stand for the selected operation by an umpire. In fact, r 
is said to be a measure for the game's entanglement i.e., 
T = 7r/2 means the maximally entangled state. If we have 
prepared the initial game setup such as the separable or 
entangled state, then we can paly PD in each player's 
strategic space Sk- Each player may execute own's qubit 
by Sfc £ Sk for increment of his payoff, of course the 
player let to moves secretly. It provide to be sufficient to 
restrict the strategic space to the two-parameter set of 
2x2 matrices, 

rytn (e'^cos9/2 sin 0/2 \ 
^(^''^)=(, -sin 0/2 e-^ cos 0/2 ) 

where < < tt and < < 7r/2. From the result of the 
Eisert et al. ■, where various kinds of strategy are 

introduced, we can find the general quantum leaps in the 
game theory. Specially the strategy Q®Q= C/(0, 7r/2) ® 
C/(0, 7r/2) give a new Nash equilibrium, against to clas- 
sical strategy D ® D = U{tt, 0) U{tt, 0). Here, we will 
briefiy depict the evolution of qubit state. The prepared 
initial state of the two qubit, to play on the game, is 
denoted by |'0o) = J|00). Next the participants, Alice 
and Bob, play own's qubit using the unitary operation 
C/fc. The state of the game is prepared, after the previous 
strategic process, like that 

\^bf) ^ P{Ua®Ub)\4'o) (4) 
where \ijjf) means the final state of the game, see figure in 



P|. Before we measure the final state, the Alice's payoff 
can be expected as 11^ = ^^06 ~^ l^^bc ~^ ^^cb + '^^bb 
where Pj^^±^ — is the measurable prob- 

ability with corresponding payoff as the coefficient re- 
spectively. If the each qubit is measured by appropriate 
device, then each player obtains the corresponding payoff 
which is a resultant value of the quantum PD. For exam- 
ple, the case of r = 7r/2 and Q ® Q build up to the new 
Nash equilibrium {a, a} from the classical equilibrium 
{7,7}. That is, the classical Pareto optimal correspond 
to the new stable point of the PD. 

Now we will compute the new Nash equilibrium in the 
PD, which is occurred by the some additional quantum 
strategy such as Walsh-Hadamard transformation H, 

or Pauli operations 

= (1 0)' = V)' '^^^ (0 -i)- 

First of all the initial state |^o) let to be the maximally 
entangled state, changing the variational value r from 
to tt/2 and the choice of U. For instance, if the en- 
tangled measure r fix to 7r/2 and the umpire's choice U 
select to D ® D, then the value J is locked up 0, 
like that J = e^^^D®D/i) ^ ^(/®2 + ib®-^)^ where / 
means the identity operation. Therefore, we obtain the 
maximally entangled state, denoted by jV'o) = ^|00) = 
-^(|00) and sequentially get to the separable fi- 

nal state 1^/"/), by the disentangling operation after the 
local operations Ua — D and Ub — D are done of each 
player respectively i.e., \i)f) = ^J'f(lll) -hi|00)) = |11). 
Although we perform the measurement by a certain ob- 
servable ^A(B) with the combinations of 

{a|00)(00|,,5(/3)|01)(01|,/3(,5)|10)(10|,7|ll)(ll|}, (7) 
the Nash equilibrium, 

{Ua - ir{SlA\^f){i^f\), 

tiB=ir{(lB\4>f){4>f\)} = {l,l} (8) 

with the probability 1, is obtained on this traditional 
strategy. This is only the classical consequence. 

But if we choose the some less quantum strategy and 
full quantum strategies such as the Hadamard transfor- 
mation H and Pauli matrices {ax,cry,az}, then the new 
Nash equilibrium is created, that is, we will choose the 
extended strategic space S. Given the entangled state 

|V^o) = ^(|00)-z|ll)) (9) 

with J — e}-'^('^v®'^v')l'^ transform to the final state via the 
local operation Uk = Hk G Uk{(^, 4>), 

\^f) =i(|00)-|01)-|10) + |ll)). (10) 
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The final state ^ gives Ha = tr(f7A|Vv)(-0/|) = 3(" + 
(5 + /9 + 7), that is, 

{n^, IIb} ^ {\{a + S + P + j),^ia + p + S + 7)}.(11) 

Equation Hll() is always superior to the classical equilib- 
rium {j{a + 7), j{a + 7)}. But the value 6 equal to — /3, 
the skew-symmetric case, only gives classical payoff. If 
the each player choose another strategy Uk = o^z, then 
the final state will be 

IV"/) = ^JH^c^a ® (t,b)(|oo) + i\n)) = |oo) (12) 

and the expected payoff is {flAjIIs} = {a, a}. 
This value is the new Nash equilibrium within the 
quantum strajtegy, i.e., {11^ (cr^, cr^), nsCo-^, cr^)} > 
{n^ (cTz, C^s), lis (C/^A, Cz)} for all Uk. We can conclude 
that the allowed SB's quantum strategies always predict 
the human's choice (diagonal elements), the solution of 
the game. Although the classical solution to the PD us- 
ing repeated game gives a equilibrium point in {a, a} |8| 
as like as Eisert et ai, the quantum solution to the PD is 
one-shot solvable. After the expected payoff is reported 
with the PD, we can confirm the solution of the NP un- 
ambiguously. 

B. Quantum NP Solution 

The quantum solutions to the PD, two-person com- 
plete information game, directly correspond to the NP, 
if we consider the game NP only its reduced one- 
person game. The Nash equilibrium change {7,7} to 
{i(a + S + l3 + '^), j{a + S + l3 + ^)} on the Hadamard 
strategy. Furthermore the strategy (7z creates the new 
optimal equilibrium {a, a}, corresponding to the classi- 
cal Pareto optimal. Therefore we can adapt the solution 
of the NP from PD, as the payoff Ilfc = a. Another 
words the human's unique choice Boxl is restricted by 



quantum mechanical SB's prophecy, maybe SB's predic- 
tion is always corrected as the diagonal elements in the 
given payoff matrix. This result also can be calculated W 
the convex combination of the human's pure choice j5(, 
let's the human choose the Boxl with a certain proba- 
bility p G [0, 1] and Box2 with a probability 1 ~ p, then 
the intermediate state of the NP (using the Hadamard 
transform H) is given by 

So that the expected payoff of the NP is IVa = \\oi+{2p— 
1)5 + {2p — l)/3 -I- 7], coincide with the previous value in 
the PD. If one player choose the quantum strategy, such 
as (7a; occur a previous situation, the Nash equilibrium 
= a, this is the predicted value by SB's Ub = a.. 

IV. CONCLUSION 

In this paper, we used the payoff matrix with arbi- 
trary variables and also described the performance of 
quantum strategies, which enable to represent classical 
strategies. Especially we regarded the one-person NP 
as a two-person PD problem and then, under quantum 
strategies, obtained more reasonable and uplifted solu- 
tions over classical strategies. In the same strategies, we 
found out the Nash equilibrium always lies along the fair 
values, regardless of the payoff matrix. That is, the win- 
ner is never determined by the payoff, but by the level 
of strategies. Therefore the Newcomb's paradox is not a 
paradox under the fair situation. 
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